Picture for Erhan Zhang

Erhan Zhang

UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems

Add code
May 26, 2026
Viaarxiv icon

Tournament-GRPO: Group-Wise Tournament Rewards for Reinforcement Learning in Open-Ended Long-Form Generation

Add code
May 26, 2026
Viaarxiv icon

PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training

Add code
Apr 04, 2026
Viaarxiv icon

JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG

Add code
Jan 29, 2026
Viaarxiv icon

Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search

Add code
Jan 08, 2026
Viaarxiv icon

Leveraging LLMs to Evaluate Usefulness of Document

Add code
Jun 11, 2025
Viaarxiv icon

Exploring Human-Like Thinking in Search Simulations with Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

RecGPT: Generative Personalized Prompts for Sequential Recommendation via ChatGPT Training Paradigm

Add code
Apr 06, 2024
Viaarxiv icon

USimAgent: Large Language Models for Simulating Search Users

Add code
Mar 14, 2024
Figure 1 for USimAgent: Large Language Models for Simulating Search Users
Figure 2 for USimAgent: Large Language Models for Simulating Search Users
Figure 3 for USimAgent: Large Language Models for Simulating Search Users
Figure 4 for USimAgent: Large Language Models for Simulating Search Users
Viaarxiv icon

Brand Celebrity Matching Model Based on Natural Language Processing

Add code
Aug 18, 2022
Figure 1 for Brand Celebrity Matching Model Based on Natural Language Processing
Figure 2 for Brand Celebrity Matching Model Based on Natural Language Processing
Figure 3 for Brand Celebrity Matching Model Based on Natural Language Processing
Figure 4 for Brand Celebrity Matching Model Based on Natural Language Processing
Viaarxiv icon